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Weak gravitational lensing of background galaxies by foreground clusters offers an excellent op- 
portunity to measure cluster masses directly without using gas as a probe. One source of noise 
which seems difficult to avoid is large scale structure along the line of sight. Here 1 show that, by 
using standard map-making techniques, one can minimize the deleterious effects of this noise. The 
resulting uncertainties on cluster masses are significantly smaller than when large scale structure is 
not properly accounted for, although still larger than if it was absent altogether. 
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Introduction. Clusters of galaxies are powerful cosmo- 
logical probes 0. In particular, the number density of 
clusters as a function of mass and redshift depends sensi- 
tively on the cosmological growth function which in turn 
depends on the matter density and properties of the dark 
energy 0. Indeed, it is conceivable that we will learn as 
much about dark energy from clusters as from super- 
novae Q- 

While redshifts of clusters are relatively easy to obtain, 
cluster masses are much harder to pin down. A cluster 
mass can be estimated from optical richness or from its 
temperature, measured either in the X-ray or with ra- 
dio observations of the Sunyaev-Zel'dovich distortion of 
the cosmic microwave background. However, the scatter 
around any of these indicators is large jj] and depends on 
complicated physics, such as radiative transfer, star and 
galaxy formation, cooling, accretion, and feedback mech- 
anisms. In principle, gravitational lensing ^ allows for a 
more direct mass determination, since the distortions of 
background galaxies are sensitive only to the mass along 
the line of sight, not to the gas. In practice, there are 
many hurdles to overcome, most of which involve the ob- 
servations themselves. 

There is one systematic that affects lensing determina- 
tions of clusters that cannot be cured with better instru- 
ments or seeing. Large scale structure along the line of 
sight can be mistaken as being part of the cluster . As 
Hoekstra 7] has shown, the uncertainties caused by this 
noise can significantly impair our ability to estimate clus- 
ter masses. Since cluster abundances depend sensitively 
on their masses, this uncertainty clouds the possiblity of 
learning about dark energy from clusters. 

Here I show that one can partially offset the deleterious 
effects of large scale structure by accounting for it in clus- 
ter mass estimates. To demonstrate how this works, I will 
focus on a single cluster at redshift 0.3 with a Navarro, 
Frenk, and White (NFW) profile 0. The mass enclosed 
within a radius within which the average density is 200 
times as large as the critical density, M200, will be set to 
1.4 X 1O^^M0 and the concentration c to 4.64. I will as- 
sume we have ellipticities for thirty background galaxies 
per square arcminute, all at redshift one. These parame- 
ters will allow us to compare with the results of Ref. J] . 
Effects of Large scale structure. To get a feel for some 
numbers, the shear due to this cluster has magnitude 



equal to 0.02 at an angular distance 10' away from the 
center. In a one square arcminute pixel with thirty back- 
ground galaxies, the rms noise due to the galaxy uncer- 
tainties is 0.25/\/30 = 0.046, significantly larger than 
the signal. The rms noise due to large scale structure is 
of order 0.02 for a standard ACDM model. Adding the 
two sources of noise in quadrature leads to a ten percent 
increase in the noise due to large scale structure. Hoek- 
stra however showed that the situation is not that 
simple. Since the signal to noise in each pixel is small, it 
is necessary to average over many pixels with the same 
signal, in this radially symmetric case, over azimuthal 
angle at fixed radius. 

If we average over all pixels in an annulus of width 
1' at radius of 10', then the shape noise gets reduced 

— 1/2 

by a factor of Np where Np is the number of pix- 
els, approximately equal to 207r. The shape noise con- 
tribution to this angular average then should be of or- 
der 0.046/(207r)^/^ ~ 0.006, significantly smaller than 
the signal. If the noise due to large scale structure was 
completely independent from one pixel to the next, then 
it too would be reduced by the same factor, and would 
have an rms amplitude of 0.0025. Figure shows that 
shape noise does behave as expected, but the noise due to 
large scale structure is larger than anticipated by more 
than a factor of two. Apparently, large scale structure 
produces a signal (noise) which is correlated over many 
pixels. The number of independent pixels then is much 
smaller than 207r, and the contamination of the cluster 
signal from large scale structure is much more severe than 
one might naively estimate. As shown in Figure ^ the 
situation rapidly gets worse at larger angles, so Hoek- 
stra argued that measurements beyond 15' were useless 
for determining the cluster mass profile: the noise from 
large scale structure overwhlems the useful information 
in such measurements. 

The above hand-waving argument suggests a possible 
solution. Since the large scale structure signal is corre- 
lated over ~ 4', why not make use of this correlation in 
the radial direction as well? That is, when averaging over 
the azimuthal shear, one implicitly discards all informa- 
tion about the combined radial/azimuthal structure of 
the shear. A circular blob of shear with diameter 4' due 
to large scale structure might be easily detected before 
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FIG. 1: Azimuthally averaged tangential shear i- Also 
shown is the rms noise due to uncertainty in background 
galaxy shapes and due to large scale structure. 



the angular averaging and then subtracted off. If we first 
average over angles, we lose much of our power to identify 
and account for the shear due to large scale structure. 

This is illustrated in Fig. 13 where a realization of each 
source of noise is shown in the top two panels. The noise 
from large scale structure is manifestly more coherent, 
extending over larger scales than is the shape noise. Just 
as clearly, this distinction disappears (bottom panel) if 
one averages over azimuthal angle. 

In order to retain the information necessary to dis- 
tinguish large scale structure shear from cluster shear, 
we need to start from the full shear field, ji{0a), where 
i — 1,2 labels the two components of the symmetric, 
traceless shear tensor. Assuming that the shear induced 
by large scale structure is Gaussian, the likelihood of ob- 
taining the data is 



[27r] 



det(Af)i/2 



exp 



(1) 

where the index a includes both pixel position (a) and 
shear component (i) so ranges from 1 to 2Np, with Np 
now the total number of pixels for which we have ellip- 
ticities. The noise matrix N is the sum of the covariance 
due to shape noise and to large scale structure, 



N = C"''^P'= -f G 



Iss 



(2) 



From this starting point, there are a number of directions 
one can take. Here I pursue two: (i) using the likelihood 
function and the Fisher matrix derived from it to project 
uncertainties on the two parameters characterizing the 
NFW profile and (ii) compressing the information in the 
likelihood function into a mass estimator which is optimal 
in the sense that it retains all the relevant information. 
In both cases, we can compare the results with what one 
would obtain without accounting for large scale struc- 
ture. 

Parametric Fitting. Let's assume that the cluster we are 
studying has an NFW profile and ask how well we can 
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FIG. 2: Top panel: Tangential shear -yt due to a realization 
of shape noise; Middle panel: 7t from large scale structure 
in a single realization. The noise emanating from large scale 
structure is much more coherent than that due to the uncer- 
tainty in galaxy shapes. Note especially the swath of negative 
"ft in the bottom panel extending from lower left to upper 
right. Bottom Panel: The same two realizations smoothed 
over azimuthal angle. There is no longer any clear distinction 
between the two sources of noise. 



determine the parameters characterizing this profile. The 
Fisher matrix. 
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quantifies the errors on these parameters pi(Af2oo) and 
P2(c) from a lensing survey. We will consider the errors 
on M20Q and c in three cases: (i) no large scale structure, 
N = C'shapc. ^jj-j including and accounting for large scale 
structure N = (7shape_|_(^iss. g^^^ j-j-j-j ^a,rge scale structure 
present, but parameters determined from the azimuthally 
averages tangential shear 0, (jt)- In this last case, the 
data points 7^ in equation [3] are replaced by (■jt) in 
different radial bins and the noise matrix ^^{^j) consists of 
the shape noise per pixel reduced by the number of pixels 
in the azimuthal average and the azimuthally averaged 
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FIG. 3: Top panel: Projected 1-sigma errors on i\/2oo and 
concentration from ellipticities within 20' of the cluster as- 
suming 30 background galaxies per square arcminute, all at 
redshift 1. Inner ellipse assumes there is no contribution from 
large scale structure. Outer ellipse shows the increase in er- 
rors once large scale structure is included. Middle ellipse also 
includes large scale structure, but here the parameters are 
determined directly from the likelihood function, instead of 
from (74). Bottom panel: Errors on the mass of the cluster as 
a function of the maximum angle out to which ellipticities of 
background galaxies are measured. 



large scale structure covariance matrix. Incidentally, if 
no large scale structure is present (i), then using all the 
pixels is not necessary: (74) maintains all the relevant 
information. 

The top panel of Figure O shows the projected er- 
rors from a survey which measures background galaxies 
within 20' of the cluster center. The difference between 
the inner and outer ellipse reinforces the point empha- 
sized by Hoekstra 0, that the noise due to large scale 
structure degrades the parameter determination by a fac- 
tor of order two if not properly accounted for. The mid- 
dle ellipse shows though that the situation is not quite 
this dire. If one works from the likelihood function di- 
rectly and does not azimuthally average, the effects of 
large scale structure can be minimized. 

The bottom panel of Fig.Oshows the error on A/200 (af- 
ter marginalizing over the concentration) in these three 
cases as a function of the maximum distance out to which 
data is available. Here we can see that the optimal esti- 
mator (i.e. not azimuthally averaging) reduces the dele- 
terious effects of large scale structure by roughly 50% at 
least on large scales. This realization has practical impli- 
cations: is it worthwhile taking data far from the cluster 
center or does the noise due to large scale structure make 



such data irrelevant when it comes to determining the 
cluster mass? Figure |21 shows that going out to 25' leads 
to a fifteen percent smaller error on M200 as compared 
with ^max = 15'. While this is not quite as large as the 
18% gain if there was no large scale structure, it is much 
larger than five percent gain if large scale structure is not 
accounted for. 
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FIG. 4: Convergence of the cluster smoothed with a l' Gaus- 
sian. Top panel: Input profile ; Middle panel: profile re- 
covered from simulated ellipticities accounting for large scale 
structure. Bottom panel: The difference between the LSS 
estimator and the estimator which neglects LSS. 



Mapmaking. One can also try to obtain information 
about the mass distribution in a cluster by inverting 
directly without assuming any particular profile. Since 
Kaiser and Squires first introduced this idea llo|, many 
groups have worked to develop new techniques account- 
ing for real world complications |Tll | . Here I simply want 
to discuss a way to improve an estimator by accounting 
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for large scale structure. So, I will focus on one particular 
estimator [l^, the so-called optimal estimator, used re- 
cently to make maps in CMB experiments ^3 • Although 
the maps made from this estimator are not particularly 
pretty, they do retain all the information stored in the 
likelihood function. Thus, they are very useful for quan- 
titative analysis and they can be massaged in a number 
of ways (which I will not discuss here) to produce more 
realistic pictures. The main point is to see how much we 
can learn in the face of noise due to large scale structure. 

The shear due to a cluster is linearly related to the 
convergance 



(4) 



where the index a ranges over all pixels for which we are 
fitting the surface density. This presumably will be of 
order Np (but it does not have to be exactly equal to it, 
for we may choose to estimate the density in a coarser 
grid than the measured ellipticities). The kernel is 
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cos(20) i = 1 
~ sin(20) I = 2 



(5) 



where (/> is the angle between the cc-axis and the vector 

— ^ai and A is the area of a pixel. 

The measured shear is a combination of this sig- 
nal, shape noise, and projections from large scale 
structure:7a = 7^ -I- ^^^i^p" -f . The latter two con- 
tributions have mean zero and a total covariance matrix 
TV. Then the minimum variance estimator for the con- 
vergence is 

k = CnK^N~^j (6) 
with mean equal to k"^^ and covariance matrix 



N 



(7) 



Figure 01 shows the convergance of the cluster, along 
with the reconstruction accounting for large scale struc- 
ture. Each of these was obtained from a pixelized set 
of ellipticities out to a radius of 15' which included the 
signal due to the cluster, shape noise, and large scale 
structure. Fig. |21 shows the two noise sources. The map 
in the middle of Fig. 0] was obtained with the minimum 
variance estimate of equation 

The bottom panel of Fig. 0] shows the difference be- 
tween a map which accounts for large scale structure 



and one that does not (i.e., one in which N was set to 
(^shapc-j^ Note especially that the minimum variance esti- 
mator obtains greater densities along a swath extending 
from the lower left to upper right. A comparison with 
the top panel shows that the minimum variance estima- 
tor more accurately reproduces the density. It does this 
by properly accounting for large scale structure. The es- 
timator which neglects LSS treats the negative 74 swath 
from lower left to upper right in Fig. |2] as produced by 
the cluster. Thus, the total 74 it ascribes to the cluster 
is smaller than it should be. The result is an underpre- 
diction of the density along this swath. The minimum 
variance estimator avoids this pitfall. 

How much does the variance go up when one uses the 
estimator which neglects large scale structure? Fig. |S1 
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FIG. 5: Uncertainty in the mass enclosed in an annulus of 
radius 6 and width l' from estimators compared with the 
uncertainty in the absence of large scale structure. Dashed 
curve uses the equivalent of equation without accounting 
for large scale structure. 



shows the errors on the mass within an annulus of radius 
9 compared with the errors if there was no large scale 
structure. As in the parametric estimation, we see that 
the minimum variance estimator more accurately deter- 
mines the cluster mass. While the errors are larger than 
if there had been no large scale structure, they are signif- 
icantly smaller than one gets when neglecting large scale 
structure. 
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